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Abstract 

Given two functions f,g: {0, 1}” — )• {0,1} a mapping 'll;: {0, 1}” — )• {0, !}"■ is said 
to be a mapping from f to g \i it is a bijection and f{z) = g{'tp{z)) for every z G {0,1}"'. 
In this paper we study Lipschitz mappings between boolean functions. 

Our hrst result gives a construction of a C-Lipschitz mapping from the Majority 
function to the Dictator function for some universal constant C. On the other hand, 
there is no n/2-Lipschitz mapping in the other direction, namely from the Dictator 
function to the Majority function. This answers an open problem posed by Daniel 
Varga in the paper of Benjamini et al. (FOGS 2014). 

We also show a mapping (p from Dictator to XOR that is 3-local, 2-Lipschitz, and 
its inverse is 0(log(n))-Lipschitz, where by L-local mapping we mean that each output 
bit of the mapping depends on at most L input bits. 

Next, we consider the problem of finding functions such that any mapping between 
them must have large average stretch, where the average stretch of a mapping cp is 
dehned as avgStretch(())) = E3;^j[dist(0(x), p{x + ei)\. We show that any mapping p from 
XOR to Majority must satisfy avgStretch^())) ^ c-syn for some absolute constant c ^ 0. 
In some sense, this gives a “function analogue” to the question of Benjamini et al. 
(FOGS 2014), who asked whether there exists a set A C {0,1}” of density 0.5 such 
that any bijection from {0,1}”“^ to A has large average stretch. 

Finally, we show that for a random balanced function /: {0,1}” —>■ {0,1}” with 
high probability there is a mapping p from Dictator to / such that both p and p~^ 
have constant average stretch. In particular, this implies that one cannot obtain lower 
bounds on average stretch by taking uniformly random functions. 
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1 Introduction 


Given two functions f,g: {0,1}" —)• {0,1} a mapping {0,1}” —)■ {0,1}” is said to be a 
mapping from f to g ii \s a. bijection and for every z G {0,1}"' it holds that f{z) = gigp^z)). 
Of course, if E[/] = E[ 5 f], then there are many mappings from / to g, and we can further ask 
whether there are “simple” mappings from / to g, where “simple” can mean, for example, 
that is computable by a small circuit, or has some other nice structure. In this paper we 
ask about the existence of Lipschitz mappings between some well studied boolean functions, 
including the functions Dictator, Majority, XOR, and a uniformly random balanced function. 

As a hrst example suppose that / is obtained from g by renaming the coordinates. Then, 
trivially, there is a 1-Lipschitz mapping from f to g, which simply permutes the coordinates, 
and in particular, each output bit of the mapping depends on exactly one input of its input 
bits. 

In some sense existence of a Lipschitz mapping between / and g implies some similarity 
between them because such a mapping induces 

• a Lipschitz bijection from /~^(0) to 

• a Lipschitz bijection from to and 

• a Lipschitz mapping from the cut in {0,1}” dehned by / to the cut dehned by g. 

Below we summarize the results shown in this paper. 

Bijections between Dictator and Majority 

It is a recurring theme in the analysis of boolean function that the Dictator function and the 
Majority function are in some senses, opposites of one another. For example, the Majority 
is Stablest theorem [MOOlO] states that if the noise stability of a function signihcantly 
deviates from the noise stability of Majority, the function must have an influential coordinate, 
and hence is non-trivially correlated with the corresponding Dictator function. Another 
example is the theorem of Bourgain [Bou02] (see also a recent improvement by Kindler 
and O’Donnell [K012]) saying that if the Fourier transform of a function deviates in an 
appropriate sense from that of Majority, then the function can be approximated by a junta, 
i.e., essentially depends on a small number of coordinates, which also implies some correlation 
with a Dictator function. 

Motivated by questions related to lower bounds on sampling by low-level complexity 
classes [Viol2], Lovett and Viola [LV12] suggested to further explore the differences between 
the two function, and asked whether it is true that any bijective mapping 0: Dictator~^(l) —)• 
Majority”^(1) must have a large average stretch, where by average stretch we refer to the 
quantity 

avgStretch(0) = 0(|/))], 

with X ~ G Dictator“^(l) denoting a random edge in {0,1}” such that Xi = yi = 1. 
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The question has been answer negatively in [BCS14] in a stronger sense, where there was 
shown an explicit bi-Lipschitz bijection that maps {0,to the upper half of {0,1}"^ (or 
equivalently a mapping from Dictator“^(l) to Majority~^(l)). In the same paper the following 
problem has been raised. 

Problem 1.1. Let n be odd. Is there a hi-Lipschitz bijection f : {0,1}” —)■ {0,1}"' that maps 
the half cube to the Hamming ball? In other words, is there a bi-Lipschitz bijection between 

Majority to Dictator? 

It is easy to see that there is no C-Lipschitz bijection from Dictator to Majority for 
C < n/2. Indeed for any bijection (j) from Dictator(a;) = Xi to Majority consider x G {0,1}” 
such that (j){x) = (1,1,, 1), and let y = x — Ci. Then, the weight of (j){y) must be at most 
n/2 since Majority(0(j/)) = Dictator(?/) = 0, and thus dist('i/(x),'^(y)) > n/2. 

In the other direction the answer is not as obvious, and we resolve this question in this 
paper. Specihcally, we prove the following theorem in Section 2. 

Theorem 1. For all odd integers n G N there exists a C-Lipschitz bijection t/-. {0,1}” —)■ 
(0,1}” from Majority to Dictator, where C* G N is some absolute constant. 

As mentioned above, it has been shown in [BCS14] that there exists a bi-Lipschitz bi¬ 
jection that maps the upper half of {0,1}” to {0,1}”“^. Therefore, there exists a bijection 
from {0,1}” to {0,1}” that maps the upper half of {0,1}” to {x G {0,1}”: Xi = 1}, and 
maps the lower half of (0,1}” to {x G {0,1}”: Xi = 0}, that is Lipschitz on the upper half 
of the hypercube, and on the lower half of the hypercube. However, it was not clear how to 
“stitch” these two bijections so that the endpoints of the edges in the middle layer will also 
be mapped close to each other. Theorem 1 says that this is indeed possible. 

Bijections between Dictator and XOR 

We further study the notion of mappings between boolean function by studying mappings 
between the Dictator function and the XOR function. For this question the well known 
mapping (j){xi,X 2 , ■ ■ ■, Xn) = (xi -|- X 2 , X 2 + x^,..., Xn-i + Xn, Xn) is clearly a bijection from 
Dictator(x) = Xi to XOR. Note, however, it is not bi-Lipschitz, as flipping the kth bit 
in the output changes it preimage in the hrst k coordinates. That is if ?/ = 4>{x), then 
+ ek) =x + Yl\=iei- 

In fact, it is not difficult to come up with a bi-Lipschitz mapping from Dictator to XOR. 
Indeed, define fj^xi, X 2 , ■ ■ ■, Xn) = (XOR(a;), 0 : 2 , ..., Xn), It is easy to check that ^|J is indeed 
a 2-bi-Lipschitz mapping from Dictator(a:) = xi to XOR. It makes sense, however, to ask for 
more, namely, does there exist a bi-Lipschitz mapping from Dictator to XOR that is in NC°, 
i.e., each of its output bits depends only on a constant number of input bits.^ 

We prove the following theorem in Section 3. 

^Note that the inverse mapping, namely a bijection from XOR to Dictator cannot be local since the 
dictating output coordinate must be the parity of all input bits, and thus depend on all of them. 


3 



Theorem 2. There exists a Lipschitz mapping 0 from Dictator to XOR such that each of 
its output bits depends on at most 3 input bits, 0 is 2-Lipschitz, and its inverse 0“^ is 
0{\og{n))-Lipschitz. Furthermore, the mapping (p is a linear operator over GF{2). 

Bijections between Majority and XOR 

In the paper [BCS14] the authors asked whether there exists a subset A C {0, of 
density 1/2 such that any bijection from {0, !}"■ to A must map endpoints of many edges of 
the hypercube far apart. Specihcally, for a mapping 0: {0, 1}" A they dehne the average 
stretch of (p as avgStretch(0) = Ea,g{o,i}",ie[n][dist(0(x), 0(a: + e*)] and pose the following 
problem. 

Problem 1.2. Is there a subset A C {0, of density 1/2 such that any bijection 

0: {0,1}"' —)■ A has avgStretch(0) = a;(l). 

We remark that we are not aware of the existence of a subset A C {0, of density 

1/2 such that any bijection /: {0,1}” —)■ A has avgStretch(/) > 2.1, and we hnd this open 
problem very interesting. It also makes sense to relax Problem 1.2 to an appropriate 2-set 
version, where we ask for two sets A,B & {0,1}" of density 1/2 such that any bijection 
f: A ^ B has large average stretch in the appropriate sense. 

Below we give a positive answer to the “function analogue” of this question. Specihcally, 
we show that any bijection from XOR to Majority must have large average stretch. 

Theorem 3. Any mapping 0: {0,1}" —)■ (0,1}" from XOR to Majority must satisfy 

avgStretch(0) > c^/n 


for some absolute constant c > 0. 

On the other hand, there exists a C-Lipschitz mapping 0: {0,1}" —)■ (0,1}" from Majority 
to XOR for some absolute constant C. 

We prove Theorem 3 in Section 4. 

Bijections between Dictator and a random balanced fnnction 

We also show that for a random balanced function /: (0,1}" —)■ {0,1} with high probability 
there is a mapping 0 from the Dictatorship function to / such that both 0 and 0“^ have 
constant average stretch. The proof of Theorem 4 appears in Section 5. 

Theorem 4. Let f: {0,1}" —)■ {0,1} be a uniformly random balanced boolean function. 
Then, with probability 1 — there exists a mapping (p\ {0,1}" —)■ {0,1} from Dictator 

to f such that for 1 — 0{l/n) fraction of x & {0,1}" it holds that dist(x,/(x)) < 2. In 
particular, <p satisfies avgStretch(0) = 0(1) and avgStretch(0“^) = 0(1). 

This implies that for two random balanced functions with high probability there is a 
bijective mapping between them such that both the mapping and its inverse have constant 
average stretch. 
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Corollary 1.3. Let f,g: {0,1}"' —>■ {0,1} be two uniformly random balanced boolean func¬ 
tion. Then, with probability 1 — there exists a bijection (f: {0,1}" —)■ {0,1} from f to 

g that satisfies avgStretch(0) = 0(1) and avgStretch((;/)“^) = 0(1). 

Indeed, let 0/,0g be bijections given by Theorem 4 when applied on / and g respec¬ 
tively. Then it is easy to see that the composition of (f)g with the inverse of 0/ gives us the 
desired mapping (f = cfg o 0“^. Indeed, since 0/ is a bijection, by Theorem 4 it satishes 
Pr xe{o,i}"[dist(0/(x),x) < 2] = 1 — 0(l/'n,), and hence 

Pr[dist(0g(0y^(x)),x) >4] < Pr[dist(0g(0y^(x)), 0y^(x)) > 2] -h Pr[dist(0J^(x), x) > 2] 

= 0(l/n). 

Therefore, for 0(l/n) fraction of the edges it holds that dist(0(x), 0(x -|- e*)) < 9. For the 
remaining 0(l/?7,) fraction of the edges their endpoints are trivially mapped to distance at 
most n and so the average stretch of 0 is 0(1), as required. 


1.1 Notation 


The functions used in this paper are the following. The function Majority: {0,1}" {0,1} 

is dehned as 


Majority(x) 


1 if > ^^72 

0 otherwise. 


The function Dictator: {0,1}" {0,1} is dehned as Dictator(x) = xi, i.e. its value is 

dictated by the hrst coordinate. The function XOR is dehned as XOR: {0,1}" —)■ {0,1} is 
dehned as XOR(x) = (mod 2). 

A mapping 0: {0,1}" {0,1}" is said to be C-Lipschitz if for every x,y E {0,1}" 

it holds that dist(0(x), 0(j/)) < Odist(x, y), where dist(-, •) denotes the Hamming distance 
between the strings. 

Note that in order to prove that a mapping 0 is C-Lipschitz it is enough to show that 
for every edge of the hypercube (x, x -|- e*) it holds that dist(0(x), 0(x -f e*)) < C. 

As a relaxation of the notion of being C-Lipschitz dehne the average stretch of 0 as 
avgStretch(0) = E3,^j[dist(0(x), 0(x -|- Cj)]. This means that if avgStretch(0) is large then 
many edges of the hypercube far apart, while if it is small, then the endpoints of an average 
edge are mapped by 0 close to each other. 


2 A Bijection from Majority to Dictator 

In this section we prove Theorem 1. The proof is based on the idea from [BCS14], which relies 
on a classical partition of the vertices of {0,1}" to symmetric chains, due to De Bruijn, Teng- 
bergen, and Kruyswijk [BvETKSl], where a symmetric chain is a path (c^, c^+i,..., Cn-k) in 
{0,1}", such that each q has Hamming weight i. 
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De Bruijn, Tengbergen, and Kruyswijk [BvETKSl] suggested a recursive algorithm that 
partitions {0,1}” to symmetric chains. We will follow the presentation of the partition 
described in [vLWOl] (see Problem 6E in Chapter 6), and we shall call it the BTK partition. 
We describe the partition by specifying for each x G {0, the chain Cx that contains x. 

The algorithm is iterative. During the running of the algorithm, every coordinate of x 
is either marked or unmarked, where we denote a marked 0 by 0 and a marked 1 by 1. In 
each step, the algorithm chooses a consecutive pair 10, marks it by 10, temporarily deletes 
it, and repeats the process. The algorithm halts when no such consecutive pair is left, i.e., 
the remaining string is of the form 00... 01... 11. We call this stage of the algorithm the 
marking stage, and denote the marked string by mark(a:) G {0,1, 0,1}". Define the signature 
of X, denoted by signature(x) G {0,1, U} as follows: if the bit of x was marked then 
signature(a:)i = Xt and otherwise, signature(a:)i = U. Finally, define Cx to be the collection of 
all strings whose signature is equal to signature(a;). That is all strings y agree with x in the 
marked coordinates of x, and in the remaining coordinates y is of the form 00 ... 01... 11. 

For example, consider the string x = 01100110. In the hrst iteration, the algorithm may 
mark the third and fourth bits to obtain 01100110. Then, the second and hfth bits are 
marked 01100110. Lastly, the rightmost two bits are marked, and we obtain the marked 
string mark(x) = 01100110. Therefore, the signature of x is signature(a;) = UllOO U 10 and 
Cx = { 01100010 , 01100110 , 11100110 }. 

Note that although the algorithm has some degree of freedom when choosing the order of 
marking the 10 pair out of possibly many pairs in a given iteration, the chain Cx is, in fact, 
independent of the specihc choices that were made. That is, signature(x) is a function of 
X, and does not depend on the specihc order in which the algorithm performs the marking. 
An alternative way to see it is to think of I’s as opening parentheses and of O’s as closing 
parentheses, and then mark all maximal sub-sequences of legal parentheses in the given 
string X. As a consequence, we may choose the 10 pairs in any order we wish. We will use 
this fact in the proof of Theorem 1. 

The key part of the proof is the following lemma. We remark that this lemma appears 
implicitly in [BCS14]. 

Lemma 2.1. Let n E N, and let x,y G {0,1}"' be such that dist(x,|/) = 1. Let Cx = 
{cfc, Cfc+i..., Cn-k] and Cy = {c}/, Ck'+i ..., he the chains of the BTK partition that 

contain X and y respectively. Then, dist(signature(a:), signature(?/)) < 3, where dist(-, •) de¬ 
notes the Hamming distance between two strings, that is, the number of coordinates where 
the two strings differ. In particular this implies that 

1. \k — k'\ <1. 

2. If Cj G Cx and c'-, G Cy for some j G [k,n — k] and j' G [k', n — k'], then dist(cj, cL) < 

\j -f \ + 6 . 

In particular, ii x ^ y, then the Hausdorff distance between Cx and Cy is at most 

dniCx, Cy) < 7. 
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Proof. Fix x,y E {0,1}"^ such that they differ only in the ith coordinate and x* = 0 and 
i/i = 1. We may perform the marking stage on each of them in three steps: 

1. Perform the marking stage on the prefix of the string of length i — 1. 

2. Perform the marking stage on the suffix of the string of length n — i. 

3. Perform the marking stage on the resulting, partially marked, string. 

Since x and y = x + ei agree on all but the coordinate, the running of the marking 
stage on x and y in steps 1 and 2 yield the same marking, and so signature(x) agrees with 
signature(j/) in these coordinates, and so, we may ignore the coordinates marked in the first 
two steps. 

Next we analyze the difference between the markings after the third step. Denote by 
s G {0,1,0,!}*“^ and t G {0,1,0,!}”“* the two partially marked strings such that the 
resulting strings after the second step on inputs x and y are so0ot and solot respectively. 
Let us suppose for concreteness that the string s contains a unmarked zeros and b unmarked 
ones, and the string t contains c unmarked zeros and d unmarked ones. Recall that at the 
end of the marking stage, all unmarked zeros are to the left of all unmarked ones in both s 
and t. Therefore, we may assume that 

a: = on'’ o 0 o on'^ and y = 0“l'’ o 1 o OR*'. 

At this point, it is fairly easy to be convinced that dist(signature(x), signature(j/)) is 
bounded by some constant. Proving that the constant is 3 is done by a somewhat tedious 
case analysis, according to the relations between a, h, c and d. 

Claim 2.2. For every a, b,c,d E N, we have 

dist(signature(0“l'’ o 0 o O*’!*'), signature(0“l'’ o 1 o O*’!'’*)) < 3. 

We postpone the proof of the claim for now, and move to the “in particular” part. 
Denote by Ux = {i E [n] : signature(x) = U} the unmarked coordinates of x. The first 
item follows from the fact that \Ux\ = n — 2k, and similarly \Uy\ = n — 2k'. Therefore, if 
dist(signature(x), signature(|/)) < 3 it follows that 2\k — k'\ < \Ux^Uy\ < 3, and hence, since 
k and k' are integers it follows that \k — k'\ < 1. 

For the second item take Cj E Cx and dy E Cy. Then Cj and c', differ in at most 3 
coordinates outside Ux Fl Uy since x and y have almost the same signature except at most 
in three coordinates. Inside the set Ux the string Cj is just a sequence of zeros followed by a 
sequence of ones such that its weight is j, and similarly dy restricted to Uy is a sequence of 
zeros followed by a sequence of ones such that its weight is f . Hence, inside the set UxHUy the 
strings Cj and dy differ in at most \j —j'\ +3 coordinates. Therefore, dist(cj, dy) < |j-/|+6, 
which completes the proof of Lemma 2.1. □ 

We now return to the proof of Claim 2.2. 

Proof of Claim 2.2. Let x = 0“!'’ o 0 o 0'*!*' and y = 0“!'’ o 1 o 0*’!'’'. Our goal is to show that 
dist(signature(x), signature(?/)) < 3. The proof uses the following case analysis. 
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Case 1 {b = c). In this case we have 

X = 0“ o 1^0^’ o Ol'^ and y = 0^ o o 

Their signatures are 

signature(a;) = U“ o 1^0^ o and signature(r/) = o 1^0^ o U'^. 

It is easy to verify that in this case the distance 

dist(signature(0“l'’ o 0 o signature(0“l'’ o 1 o 0^'^)) < 3. 

Case 2 (h > c). In this case we have 

X = 0“!'’-"-^ o r+iQ'+i o 1'^ and y = 0“!^+^-" o ro" o 
Their signatures are 

signature(a:) = u^+^-c-i □ xc+igc+i ^ yd signature(r/) = u“+^-c+i o o U'^. 

It is easy to verify that 

dist(signature(0“l'’ o 0 o signature(0“l'’ o 1 o 0^'^)) < 3. 

Case 3 (h < c). In this case we have 

x = 0“ol^0^o0"-^+^l'^ and ?/= 0“ o o 0"-'’-^'^. 

Their signatures are 

signature(x) = U“ o 1^0^’ o and signature(?/) = U“ o i^+io^+^ o 

It is also easy to verify that in this case the distance is at most 

dist(signature(0“l'’ o 0 o signature(0“l'’ o 1 o 0^'^)) < 3. 

□ 


2.1 The mapping 

In this section we dually prove Theorem 1. In order to prove the theorem it will be convenient 
to partition the hypercube as follows. For each BTK chain C of the (n — l)-dimensional 
hypercube dedne = {c o 6 g {0,1}"': c G C,b G {0,1}}. There is a clear one-to-one 
correspondence between the (n — l)-dimensional BTK chains and our partition of {0,1}". 
For example, the block Pc corresponding to the chain C = {00,01,11} consists of the 
following six elements Pc = {000, 001, 010, 011,110, 111}. 
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Proof. We define the mapping as follows. Let n G N be an odd integer. For x G {0,1}"', 
write it as a: = x'oxn, where x' G {0,1}"'“^ represents the hrst n — 1 bits of x, and Xn G {0,1} 
is the last bit of x. Let C = {ck, c^+i,..., Cn-i-k} be a symmetric chain in the BTK partition 
that contains x', and let j be the index such that x' = cj. Dehne 


. def J 1 O C2j_(n-fc)+x„ if |x| > (n +l)/2; 

[O o C(^ri+k)- 2 j-i-xn if kl <{n- l)/2. 

In order to illustrate the mapping, let us consider as an example the case of n = 3 and 
the block Pq that corresponds to the chain C = {00, 01,11}. 

■^(11 O 1) = 1 o 11, 

V’(iioo) = loOl, 

V’(oioi) = looo. 


?/>(01o0) = OoOO, 
^/>(00ol) = OoOl, 
^(00 oO) = Ooll, 


Note that "0 maps the upper half of (0,1}*^ to points whose hrst coordinate is 1, and 
maps the lower half of {0,1}” to points whose hrst coordinate is 0. It should be mentioned 
that the mapping 0 restricted to the upper half of (0,1}"^ is exactly the mapping used 
in [BCS14], where it was shown that the restriction of 0 to the upper half of (0,1}” is a 
bi-Lipschitz bijection. Similarly, the restriction of 0 to the lower half of {0,1}” is also a 
bi-Lipschitz bijection, and so, as mentioned above, the main difficulty in this construction 
was to “stitch” these two bijections so that the endpoints of the edges in the middle layer 
are mapped by 0 close to each other. 

We next show that the mapping 0 is indeed a 11-Lipschitz bijection from Majority to 
Dictator. Note that by the triangle inequality it is enough to show that for all edges (x, y = 
X + Ci) G {0,1}"' it holds that dist(0(x), dist(j/)) < 11. 

Write X = x' OXn where x' G (0, 1}”“^ represents the hrst n — 1 bits of x, and x„ G (0, 1} 
is the last bit of x. Analogously write y = y' o yn. Let = {ck,Ck+i ■ ■ ■ ,Cn-k} and 
Cy' = { c'y, Cfc'+i..., be the BTK chains that contain x' = Cj and y' = c'/ respectively, 

where j = \x'\ and f = \y'\. Recall that |j — j'| < 1 since \x'\ = j and \y'\ = f and x ~ 

Our goal is to show that dist(0(x), 0(|/)) < 10. We now consider several cases. 


Case 1 (|a;| = (n — l)/2 and \y \ = (n + l)/2). In this case we have 

ijjiyX') 0 O C(^n+k) — {n—l) — l—x„ 0 ® ^k—Xn 

and 

— 1 ° Cn+l-{n-k')+yn — 1 ° • 

By Lemma 2.1 we have \k — k'\ < 1 and hence dist(0(x),0(|/)) < 1 + |(fc — x„) — {k' + 1 + 
yn) \ + 6 < 11, as required. 
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Case 2 (|a;| > (n + l)/2 and \y\ > (n + l)/2). In this case we have |(2j — (n — fc) + 
Xn) — (2j' — {n — k') + |/„)| < 5, and so by Lemma 2.1 the distance between and 
is at most dist(V’(a;),^/;(|/)) = d\st{c 2 j-n+k+x„,c’ 2 j'-n+k'+yj < 5 + 6 = 11. 

Case 3 (|a;| < (n — l)/2 and \y\ < (n — l)/2). This is handled similarly to case 2. 
This completes the proof of Theorem 1 . □ 

3 A Linear Bijection from Dictator to XOR 

In this section we prove Theorem 2. 

Proof of Theorem 2. We give an explicit mapping from Dictator to XOR that is a linear 
transformation over GF{2). Let A be the matrix representing the linear transformation. 
We hrst show that the mapping satishes the conditions in the lemma if A has the following 
properties. 

1. A is invertible. 

2. The hrst column of A has odd weight, and all other columns have even weight. 

3. All rows of A have weight 3. 

4. All columns of A have weight 2. 

5. All columns of A~^ have weight 0(log(?7,)). 

The hrst condition implies that A is a bijection. The second condition implies that A 
maps from Dictator to XOR. To see this, consider XOR(Ar’) for any vector v G {0,1}”. This 
is the sum over GF{2) of the weights of all columns j for which vj = 1. Because the weights 
of all columns but the hrst are 0, they can be ignored, and therefore XOR(Au) = 1 if and 
only if Ui = 1. 

The third condition implies that each output bit is local, as the Ah output bit depends 
only on the Ah row of A. The fourth condition implies that A is 2-Lipschitz. To see this, note 
that dist(x, y) is the weight of x — y, and dist(Ax, Ay) is the weight of A(x — y)- If the weight 
of each column of A is at most C, then the weight of A{x — y) is at most G times the weight 
of {x — y). The same argument applied to A~^ implies that A~^ is 0(log(n))-Lipschitz. 

Note that under the assumption that the mapping is a linear transformation, the above 
is necessary for a mapping to satisfy the conditions of the lemma. 

We now construct the mapping A. Let G be the complete binary tree on n vertices, with 
directed edges so that each points to the child. We uniquely label each vertex with a label 
from 1 to n, with the root labeled 1. Let A be /„, + M where M is the adjacency matrix of 
G. That is Ajj = 1 if i is the parent of j in G or i = j, and Aij = 0 otherwise. 
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We claim that the inverse of A is the matrix dehned as Bi j = 1 if j is a descendant of i, 
and Bij = 0 otherwise. Indeed, consider the (i, j)th entry of the prodnct A ■ B. Then 

n 

■ B)ij = ^ Ai^kBkj 

k=l 

= \{k : i = k or i is the parent of k} H {k : k = j or j is a descendant of k}\ 

\f i = j, these sets have exactly one element in common, i, and therefore {A ■ B)ij = 1. 
If vertex j is not in the snbtree rooted at vertex i, these sets have no vertices in common, 
and therefore {A ■ B)ij = 0. Finally, if vertex j is in the snbtree rooted at i but is not i, 
these sets have exactly two vertices in common, and therefore {A ■ B)ij = 0. 

Because vertex 1 is the only vertex without a parent, the hrst column of A has weight 

1. All other vertices have exactly one parent, and hence all other columns of A have weight 

2. The weight of each row of A is at most 3 since this is equal to the number of children of 

the corresponding vertex plus 1. The Ah column of A~^ is the indicator vector of the set of 
ancestors of z in G including i itself. The size of this set is bounded above by log(n) + 1. 
Therefore, A satishes the conditions of the lemma. □ 

Note that the above proof can be generalized to obtain a mapping (j) that is L-local 
2-Lipschitz such that 0“^ is G-Lipschitz, for any L and G that satisfy {L — 1)*^ > zz by 
replacing the tree G in the proof with a complete L — 1-ary tree on n vertices. Such a tree 
will have height less than G. 

4 Any Bijection from XOR to Majority has Large Average 
Stretch 

In this section we prove Theorem 3. We start with the second part of the theorem. 

Proposition 4.1. There exists a C-Lipschitz bijection tj: {0,1}” {0,1}” from Majority 

to XOR for some absolute constant C. 

Proof. Take the G-Lipschitz bijection from Majority to Dictator from Theorem 1, and compose 
it with the 2-Lipschitz bijection ((){xi, X2, ■ ■ ■, Xn) = {xi+ X2,X2+ x^.,..., Xn-i + Xn, Xn) from 
Dictator to XOR. The resulting mapping is clearly a 2G-Lipschitz bijection from Majority to 
XOR. □ 

Next we prove the hrst part of Theorem 3 showing that any bijection from XOR to 
Majority must have large average stretch. In fact we prove a stronger statement, saying that 
in every direction i G [n] it holds that the average stretch in the direction must be VL{^/n). 

Proposition 4.2. Let (f be a bijection from XOR to Majority, and let i G [n]. Then 
Ea,[dist((/)(a;), (j){x + e,)] > c^/n for some absolute constant c > 0. 
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Proof. Fix i G [n] and consider all edges in the direction Cj, i.e., the edges of the form 
{(x, X + Since XOR(x) 7 ^ XOR(x + Cj) for all x G {0, !}"■ it follows that for every 

such edge one of its endpoints must be mapped to the upper half of the hypercube, and the 
other endpoint to the bottom half of the hypercube. On the other hand, since each level of the 
hypercube contains at most {JJ 2 ) ^ vertices, it follows that for 0.9-fraction of points 

2 ; G {0,1}"' their weight differs from n/2 by more than O.Olydr. Let us call such z typical. 
Therefore, for 0.8 of inputs x it holds that at least one of the endpoints of the edge (x, x-f-e*) is 
mapped to a typical point. Let us say for concreteness that x is typical and is mapped above 
level n/2-I-O.Ol^Ar. Then, since (f){x) 7 ^ (f){x + ei) it follows that (/(x-f-Cj) must belong to the 
lower half of the hypercube, and thus dist((/)(x), 0(x -|- Cj)) > Q{y/n). Therefore, for at least 
0.8 fraction of the edges in the direction e* it holds that dist(0(x), (f){x -|- Cj)) > 0 . 02 -y/n. □ 

This clearly implies Theorem 3. 


5 Bijection from Dictator to a Random Function 

In this section we prove Theorem 4. We start with the following claim. 

Lemma 5.1. Let A C {0,1}" be a random set chosen by picking each x G {0,1}" to be in A 
independently with probability 0.5. Then with probability 1 — there exists an injective 

mapping cfA- {0, 1 }"“^ —)■ {0, 1 }"' such that the following holds. 

1. For all X G {0,1}"“^ it holds that dist(x, 0 a(x)[i^...^„_i]) < 1, where 0A(a;)[i,...,n-i] de¬ 
notes the restriction of to the first n — 1 coordinates. 

2. Pr^^{oA}n-i[(j)A{x) G A] = 1 - 0(l/n). 

We postpone the proof until later and show how to prove Theorem 4 using Lemma 5.1. 

Proof of Theorem 4 . We start with the following simple claim, which is immediate from 
Lemma 5.1. 

Claim 5.2. Let A C {0,1}” be a uniformly random subset of size 2”'“^. Then, with probability 
1 — there exists a bijection 0: {0, l}""”^ —>■ A such that for 1 — 0(l/n) fraction of the 

inputs it holds that dist(x, 0 (x)[i^.,,^„_i]) < 1 . 

Proof. Let us sample a subset Ai C {0,1}"^ of size exactly 2”“^ in the following manner. Pick 
a random subset A C {0,1}" by choosing each x G {0,1}" to be in A independently with 
probability 0.5. Then, ii A < 2"^“^ we add to A uniformly random elements from {0,1}” \ A 
one by one until the size of A becomes 2 "“^. Similarly, if ^4 > 2 ”“^ we remove random 
elements from A one by one until the size of A becomes 2"“^. Let Ai be the obtained set. 
Clearly Ai is indeed a uniformly random subset of {0,1}"^ of size 2"^“^. 

By Lemma 5.1, with probability there is an injective mapping 0: {0,1}"“^ A 

such that dist(x, 0(x)[i_,,,^„_i]) < 1 for all x G {0,1}"'“^, and for all but 0(l/n) fraction of 
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the inputs it holds that 0(a;) G Ai. By the Chernoff bound with probability 1 — the 

sets A and Ai differ in at most most 2"’/n elements. Therefore, we can modify 0 in 0{l/n) 
fraction of the inputs so that the obtained mapping is a bijection from {0,1}”“^ to A that 
satisfies the requirements of the claim. 

□ 


In order to prove Theorem 4 we sample a uniformly random balanced boolean function 
/: {0,1}"' —?■ {0,1} as follows. Pick Ai C {0,1}" of size 2"“^ uniformly at random, and 
let Ao = {0,1}" \ Ai, Define / to be the indicator function of Ai, i.e., Ai = /~^(1) and 
^0 = /~^( 0 ). ^ 
Since Ai is a uniformly random set of size 2 "“^ by Claim 5.2 with probability 1 — 
there is a bijection : {x G {0,1}"“^: Xi = 1} —)■ Ai such that dist(x, (/)i(x)) < 2 for 
all but 0(l/n) fraction of the domain of 0i. Similarly, Aq is also a uniformly random 
subset of {0, 1 }" of size 2"“^, and hence with probability 1 — there is a bijection 

00 : {x G {0,1}"“^: Xi = 0} —)■ Aq such that dist(x, 0o(3^)) < 2 for all but 0(l/'n.) fraction of 
the inputs 

By the union bound with probability 1 — 2 “^”^"' both 0o and 0i exist. Since 0o and 
01 are defined on disjoint domains, whose union is the entire hypercube, we can define 


0 : { 0 , 1 }" ^ { 0 , 1 }" to be 


0 (x) 


01 (x) if Xi = 1 
0 o(x) if xi = 0 . 


Clearly 0 is a bijection from Dictator to / and it satisfies 


Pr [distfx, 0(x)) < 2] = 1 — 0(l/?7,), (2) 

xe{o,i}" 

as required. The “in particular” part of Theorem 4 follows immediately from (2). Indeed, 
by (2) it follows that 1 — 0 (l/? 7 ,) of the edges (x,x + e*) satisfy 

dist(0(x), 0(x + 60) < dist(0(x), x) + dist(x, x + e0 + dist(x + Cj, 0(x + e*)) < 5, 


and therefore 


avgStretch(0) = E 3 .g{o,i}n_ig„[dist( 0 (x), 0(x + e0] < 5 • (1 - 0{l/n)) + n • 0(l/n) = 0(1). 

In order to see that avgStretch(0“^) = 0(1) note that 0 is a bijection, and so by (2) we have 
Pr 3 ,g{o,i}"[dist(x, 0“^(x)) < 2] = 1 — 0 ( 1 / 77 ,). This completes the proof of Theorem 4. □ 

We now return to the proof of Lemma 5.1. The proof relies on an algorithm from [HLN87]. 

Proof of Lemma 5.1. In order to describe the algorithm let A be a random subset of {0,1}". 
For each x G {0,1}"“^ say that x is rich if both x o 0 and x o 1 belong to A, and say that 
X is poor if none of x o 0 and x o 1 belongs to A. If there were no poor vertices in {0,1}"“^ 
then we could define f/A by extending its input x to either x o 0 or x o 1 . However, since 
the subset A is uniformly random, roughly 1/4 fraction of the vertices in {0,1}"“^ will be 
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poor, and we will match all but 0 {l/n) fraction of poor vertices with a neighboring rich 
vertex. Then, we will dehne a mapping 0^ in the following way: (1) if x is neither rich nor 
poor, then dehne (1>a{x) = x o h, where 6 G {0,1} is such that x o b E A, (2) if x is rich, 
then dehne (j)A{x) = x o 1, (3) if x is poor and is matched with a rich vertex y, then dehne 
4>a{x) = 2 / o 1, (4) otherwise, x is poor and is not matched with a rich vertex, in which case 
we dehne ((>a{x) = x o 0. 

Clearly such a mapping cjiA satishes the condition that dist(x, 0A(a^)[i,...,n-i]) < 1 for all 
X G {0,1}"'“^. We will dehne the matching so that only 0{l/n) fraction of the poor vertices 
will be of type (4), i.e., will be poor and not matched to a neighboring rich vertex, and hence 
only those vertices x will be so that (j)A{x) ^ A. 

The algorithm for hnding such a matching is the following. 


Algorithm 1 Matching poor vertices to rich vertices 
1: for i = 1 ..., n/2 do 

2: if X is poor and not matched and x + is rich and not matched then 

3: Match X with x + e* 

4: end if 

5: end for 


Remark 5.3. We remark that we could allow the loop to run until n, however for the analysis 
it will be more convenient to stop after n /2 steps. 

The following two claims from [HLN87] are the key steps in the analysis of the algorithm 
above. 

Claim 5.4 ([HLN87, Lemma 1]). For every k < n/2, the status of x in the kth iteration of 
the algorithm is independent of all vertices that differ from x in some coordinate larger than 

k. 

In particular for any z G {0, 1 }^ let A^ = {x E {0, 1 }"“^: = z}. Then, in the kth 

iteration of the algorithm the status of each vertex in A^ is independent of the others. 

Proof. At each iteration i the vertices that affect each other are matched according to the 
edges in the direction e*. Therefore, any two vertices that differ in some coordinate larger 
than k had no interaction between them, and so are independent of each other. □ 

Claim 5.5. For every x E {0,1}”“^ and i = 1,... ,n/2 let pi be the probability that x is poor 
and unmatched after iteration i. Then 

1 . Pi+i = Pi{l - Pi) for all i < n/2. 

2 . pn /2 < 2/n. 

Proof. Let g* be the probability that x is rich but unmatched after round i, analogous to 
Pi. In the {i + l)st round, x is poor and unmatched if x was poor and unmatched after the 
ith round, and x + Cj+i is rich and unmatched after the ith round. By Claim 5.4, these two 
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events are independent, and therefore = Pi{l — qi). Similarly, can be expressed as 
gj+i = qiil —pi). Subtracting these two equations, we see that —pi+i = qt — Pi = qo — Po 
for all i. This is natural as the difference between the number of rich unmatched vertices 
and poor unmatched vertices stays constant throughout the rounds. Because po = 1/4 and 
go = 1/4, this difference is 0 and therefore g* = pi for all i. Substituting g^ in the expression 
for pj+i yields pj+i = Pi{l — Pi). This proves the hrst part of the claim. 

To prove the second part of the claim, we show by induction that Pi < 1/i for alH > 1. 
Indeed, since po = 1/4 the claim holds for i < 4. For the induction step for i > 4, if 
Pi < 1/* < 1/4, then = pj(l — pi) < — 1/i) < l/(i + 1), as required. 

□ 

We are now ready to complete the proof of Lemma 5.1. For each z G {0,1}"/^ consider 
the set Az = {x & {0,1}"“^: = z}. By Claims 5.4 and 5.5 each a: G is poor and 

unmatched with probability Pn /2 < C/n independently of all other vertices in A^. Therefore, 
since |^^| = 2"^/^, by the Chernoff bound the probability that A^ contains more than a 2Cjn 
fraction of poor and unmatched vertices is at most By taking union bound over 

all z G {0,1}"/^ we conclude that with probability 1 — there exists a matching that 

matches all but 0(l/n) fraction of the proof vertices with a rich neighboring vertex. This 
completes the proof of Lemma 5.1. □ 

6 Open Problems 

Below we list several open problems. 

Question 6.1. In Theorem 2 we constructed a linear mapping from Dictator to XOR that 
is 3-local, 2-Lipschitz such that its inverse is 0{\og{n))-Lipschitz. Is there a mapping from 
Dictator to XOR that is 0{l)-local, and 0{l)-bi-Lipschitz? In particular, it would be inter¬ 
esting to find such a mapping that is non-linear. 

Question 6.2. We proved in Theorem f that for a random balanced function f with high 
probability there is a mapping cff from Dictator to f such that avgStretch((/)) = 0(1) and 
avgStretch((/)“^) = 0(1). Is it true that with high probability there is a bi-Lipschitz mapping 
from Dictator to a random function, i.e., a mapping with bounded worse case stretch? 

Question 6.3. We proved in Theorem 3 that any mapping from XOR to Majority must have 
average stretch larger than Is this bound tight? Is there a mapping cf from XOR to 

Majority such that avgStretch((/)) = o{n)? 

In this paper we only considered mappings between functions with the same domain. If 
we allow one of the functions to have a larger domain, we may relax the requirement that 
a mapping between function must be a bijection, and only require that the mapping be 
one-to-one. Given Theorem 3 we ask the following question. 

Question 6.4. Is there a Lipschitz embedding (j) : {0, !}"■ —)■ {0, l}P°h(fi) XOR(z) = 

Majority((/(; 2 )) for all z G {0, 1}'^? 
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